I had the great experience of presenting at EuroPython 2016. My talk entitled “Clean code in Python”, was about good development practices, down to low-level design (with code examples), for Python. The idea of the talk, was to present the “pythonic” approach for writing code, and how do general concepts of clean code apply to Python.

These examples might be useful for beginners, developers experienced in other languages coming to Python, and people using Python for scientific applications. The examples could also be helpful for senior developers, because they remind real situations that might appear in pull requests, while doing a code review.

Here is the video on YouTube:

And the slides (which I also made available along with the source code, shortly after the presentation finished).

The presentation was well received: some attendees told me they liked it (even asked for the slides and code), and I got good advices. The following days of the conference, more people told me that they liked the presentation, and some others mentioned (something I did not think at the beginning, but that it makes perfect sense), that these ideas are really useful for people using Python in scientific environments.

I am glad it was useful for the community.

EuroPython 2016 remarks

Mariano Anaya

Sun Jul 31, 2016 - 12:06 -0300

Last week, EuroPython 2016 finished, and it was an amazing conference I had the pleasure to attend. Here is my review of those days.

The conference

I arrived on Saturday noon at Bilbao, Spain, the day before the conference, so I had some time to know the city, see the venues, etc. The next day, on Sunday, was for two separate workshops: Django girls and Beginner’s day. I attended the beginner’s day as a coach, and helped a group of intermediate developers with several exercises aimed at explaining some Python concepts, such as: context managers, decorators, magic methods, generators, etc. It was really curious that some of these topics were those I was going to cover on my talk on Wednesday, so I felt really glad about that. I took an oath (very funny BTW) for becoming a beginner’s mentor, and so I did (it was really good actually). I had a great time helping other developers, exchanging ideas and experiences during lunch, solving problems, and getting a first glimpse on what the conference was going to be like.

The moment of the oath for becoming a mentor, and earning the badge.

After the workshop finished, I walked to the main venue, and gave a hand packing the bags of the conference. After that, time to see around Bilbao.

From Monday to Friday was the conference itself, with all the talks, and trainings.

Monday started with the introduction to the conference, and shortly thereafter, the very first keynote by Rachel Willmer, who gave a great presentation, sharing a lot of experience, and interesting ideas.

At around noon there was a keynote by N. Tollervey about MicroPython. The presentation was excellent (one of the ones I liked the most), and the idea of the project is awesome. On top of that, it was announced that the BBC was giving away micro:bits for the attendees of the conference, so it was a great surprise to pick up mine at the conference desk. I even started playing around a bit with it (more in a future post).

The rest of the afternoon, I attended several talks. At the end, there were, of course the lightning talks, which were amazing.

Tuesday started with the keynote by P. Hildebrant, presenting how Disney uses several technologies, including Python, as support for movies and productions. It was very good and enlightening to see an endeavour of such extent with Python. After that, during morning I attended a workshop about Async web development, with several Python technologies for doing asynchronous computation.

During the afternoon, I watched several great talks, including “Protect you users with Circuit Breakers”, and several other good ones, closing with the lightning talks.

Wednesday was the day of my talk, so I attended some talks during morning and then, at the afternoon, I presented mine. I really liked how it developed. Moreover, it was really good to receive good feedback from some attendees, saying they liked it, and that it was useful for them. Shortly thereafter, I published the slides and the source code.

On Thursday, there were some talks about async/await and asynchronous programming in Python 3, mocks, and high-availability architecture.

On Friday, the keynote was about how Python is used by the scientific community. It was very enlightening, and interesting to see another use case of Python, and how is becoming the main technology on this area.

The talks during morning in this case, were divided among several topics, being the main ones: instrumentation for performance metrics, “How to migrate form PostgreSQL to HDF5 and live happily ever after”, “Split Up! Fighting the monolith”. During the afternoon, I joined a workshop about Docker, on which we built an application using Docker-combine, and followed good practices.

It is worth mentioning, that on Friday there was an special edition for lightning talks, which was not in the original schedule. After making some arrangements, and due to some on-the-fly changes, it was possible to have another session for lightning talks, right before the sprints orientation and the closing session.

Saturday and Sunday were for sprints (hackathons). On Saturday I joined to sprint on aiohttp, and actually submitted a pull request, that was merged, whereas on Sunday I wanted to check on a pytest issue.

Sprints @europython! pic.twitter.com/PVQZw9rSvV
— PySS Society (@acpyss) July 23, 2016

My talk

It was great to have the opportunity to present at EuroPython. What was even better, was the positive feedback I got from other attendees, and the fact that it was useful and interesting for them (which was, in the end, what I cared most about). I found the experience very positive.

From the comments, I gathered something I have not noticed when I first envisioned the talk, which is how useful these concepts might be for people using Python for scientific applications. It seems, scientists using Python for data processing or computation, do not usually have the background of a developer, so concepts like code readability, technical debt, and maintainability, are helpful in order to improve the code base. This gave me the idea of adapting the examples, perhaps adding one related to these areas.

Python use cases

There were people from many countries, industries, and companies with different backgrounds. The trend seems to be now on data science, but Python is widely used in many areas.

I believe the main areas of focus for Python are: software development, system administration / Dev Ops, and science.

There were talks, tracks, sessions, and trainings for all of them, with very technical detail.

Highlights

There were so many great talks and resources that I cannot name each single one of them, so I will point the main topics and some of the talks that grabbed my attention the most, but please keep in mind that all were great.

Among the many things pending to test and research, are also books. I learned about PYRO4, for managing Python remote objects, which seems like a promising technology. I will dive into more detail on conda and the building systems, conda channels, etc. The talk “Exploring your Python interpreter” was really interesting, and it was a good introduction, in order to become involved with CPython development.

I attended many talks about the latest features of Python 3.5, such as asyncIO, coroutines, and all the new functionalities for asynchronous programming, and they all were really interesting. In particular “The report of Twisted’s Death” was very interesting, and (spoiler alert), it looks like still has an interesting future competing with the new libraries and standards.

On the lightning talks, it was presented a reverse debugger (revdb), and its demo was amazing.

Conclusion

After attending many talks, and trainings, talking to many other experience developers, system administrators, and data scientists, I can state that the conference has an amazing learning environment, and the outcome was completely positive. It was useful for catching up with technology, checking the environment and see how Python is being used or deployed in the wild, learn from use cases, experiences, and exchange ideas.

The content was really inspiring and open-minding. I have lots of items to check, as points for research, which I will cover in following entries.

Python 3 is much more widely used than one would expect. It is actually the standard now, and many talks (including mine), were using Python 3 code, but most importantly, most projects are now in this version, whereas Python 2 looks like the legacy option. Good news :-)

All in all, this edition of EuroPython was awesome, and I am looking forward to presenting again next year!

Upcoming talk at EuroPython 2016

Mariano Anaya

Sun Mar 27, 2016 - 16:37 -0300

I am glad to inform that I will be speaking at EuroPython 2016 conference.

I’ll speak @europython 2016:
‘Clean code in Python’ https://t.co/e4clWU3aDM
— Mariano (@rmarianoa) March 27, 2016

My submission about clean code in Python was accepted, so in the next edition of EuroPython 2016, in Bilbao, Spain, I will talk about clean code principles for Python. Here is the abstract:

https://ep2016.europython.eu/conference/talks/clean-code-in-python

The full list of talks is available at:

https://ep2016.europython.eu/en/events/sessions/

If you are interested, subscribe to the EuroPython blog and Youtube channel. I will include more details in a separate post.

Glimpses of a Vim configuration

Mariano Anaya

Sat Feb 27, 2016 - 13:06 -0300

It’s been a while since I started tracking versions of my custom Vim configuration, and making it available as an open source software in Github. The best of this project is, in my opinion, to have it under version control, so I can track changes and releases.

Every once in a while, when I find a new setting, or a great new feature, I modify the configuration, so they will become available on the next release. Besides the features that are mentioned in the project, and the customizations made, I feel very comfortable with the colour scheme I made.

Here are some glimpses of it:

First capture of colours, and layout

The colour scheme is general for the syntax highlighting of all types recognized by Vim. Please note this might also depend on the configuration of your terminal.

The tabs are also themed according to the menus.

Any suggestions or improvements to the code and configuration can be made on the Github project.

Deleting commented out code

Mariano Anaya

Wed Feb 17, 2016 - 22:06 -0300

This is a rule I always encourage in software development. Moreover, I consider it to be something that has to be included in every coding guideline of a good project.

There are several reasons for this, but probably the best explanation can be found in the book “Clean Code”1 by uncle Bob, on which explains that the code, gets outdated (rotten) with the rest of the surrounding code, and hence it makes a place for confusion, leading to an error-prone spot.

There are, however, people that seem to find some arguments for commenting out code, or leaving it. Some common arguments/reasons usually are:

“I might need this functionality later..”

We have source control systems (for example git) for this. In git, anything can be restored from a previous point. If the software is properly under version control, there is no reason to fear data loss. Trust git, code fearlessly.

“This is temporary disabled… It will be restored later”.

Again, same principle, rely on the version control system. Save a patch, and then restore later, or stash the changes, revert the commit, etc. As you see, there are plenty of better options for solving this scenario.

Code that was left from the fist version

Probably debugging leftovers. No doubt here: seek, locate, destroy.

There is, a clear problem with code that is under comment, which is that it is “frozen” in time: it was good at some point, but then it was left there while the rest of the code around it, evolved, so this old code might not certainly work (hence it is “rotten”), so un-commenting it is a bad idea because it will probably crash.

Another problem is that it can be a source of bias for some other developer, who wants to maintain that code at a future point in time. The one who left the rotten code, might have thought that it was a source of inspiration for when this functionality was going to be applied, but instead, it is just biasing the new developer with this skeleton, preventing from a brand new, fresh idea for that code.

Therefore, for these main reasons (an probably much more), having code that is commented in a code base, is a poor practice, (not to mention a code smell). If you are a seasoned developer, who cares about code quality, and best practices, you must not doubt when deleting it. Delete commented out code mercilessly: seek, locate and destroy.

1: A book I highly recommend: https://www.amazon.com/Clean-Code-Handbook-Software-Craftsmanship/dp/0132350882

strncpy and strncat for copying strings in C

Mariano Anaya

Sun Sep 27, 2015 - 18:34 -0300

Recently, I’ve read an interesting article 1, explaining why the strncpy function in C is not safer than strcpy. The post was very interesting, but what’s more, it suggested an alternative idiom for copying strings in C, that might probably be the way to go.

Later, in another article 2 that compared some functionality in C, Python and Go, one of the comments pointed out that very same idiom. That grabbed my attention, so I decided to try it in an example.

The problem with strncpy seems to be the way it manages the source string to be copied. Based on the sample code provided in the documentation 3 (that should be just a reference), the break condition is up to n characters (the third parameter) or until the source string is exhausted, whatever happens first. This should not be a problem, unless n < strlen(source_string). That parameter would make strncpy to finish before it can put a \0 character at the end of the target string, leaving an invalid array of characters 5.

This is an example.

	`#include <stdio.h> /* stdout, fprintf */`
	`#include <string.h> /* strncpy, strlen */`

	`#define TARGET 10`

	`int main(int argc, char* argv[]) {`

	`char src = "Castle"; / 6 chars long */`
	`char dst[TARGET] = "__________";`

	`dst[TARGET - 1] = '\0';`

	`fprintf(stdout, "%s\n", dst); /* must be: '_________' */`
	`/* What happens if I pass a wrong length (lower than the actual`
	`* strlen) */`
	`strncpy(dst, src, 3);`
	`fprintf(stdout, "%s\n", dst); /* must be: 'Cas______' */`
	`/* If I copy the string correctly, by passing the right`
	`* length, then strcnpy behaves as expected */`
	`strncpy(dst, src, strlen(src) + 1);`
	`fprintf(stdout, "%s\n", dst); /* must be: 'Castle' */`

	`return 0;`
	`}`

On this example, the target array is represented by the variable dst, and I used a fixed-length string, on purpose for the demonstration, simulating what would actually happen. I null-terminated it so the program can finish successfully, because otherwise the operations on it would not end until the delimiter is reached, and we cannot know when that will happen, considering what’s in memory at that time. In addition, the unpredictable behaviour will lead to errors, and probably to memory corruption. The underscore, should be interpreted as slots: regions or reserved memory that are there, but empty.

The proposed idiom uses strncat (see 4), tricking the function by passing it an empty string as the first parameter, and then the actual string we need to copy. This call will render the same result, but without the previous side effect. Let’s see an example:

	`#include <stdio.h> /* stdout, fprintf */`
	`#include <string.h> /* strncat, strlen */`

	`#define TARGET 10`

	`int main(int argc, char* argv[]) {`

	`char src = "Castle"; / 6 chars long */`
	`char dst[TARGET] = "__________";`
	`dst[TARGET - 1] = '\0';`

	`fprintf(stdout, "%s\n", dst); /* must be: '_________' */`
	`/* Prepare destination string */`
	`dst[0] = '\0';`
	`/* Copy with strncat */`
	`strncat(dst, src, 3);`
	`fprintf(stdout, "%s\n", dst); /* must be: 'Cas' */`
	`/* If I copy the string correctly, by passing the right`
	`* length, then strcnpy behaves as expected */`
	`dst[0] = '\0';`
	`strncat(dst, src, strlen(src) + 1);`
	`fprintf(stdout, "%s\n", dst); /* must be: 'Castle' */`
	`/* If I try to overrun the buffer */`
	`dst[0] = '\0';`
	`strncat(dst, src, strlen(src) + 10);`
	`fprintf(stdout, "%s\n", dst); /* must be: 'Castle' */`

	`return 0;`
	`}`

Here we see, the error is no longer present, probably because of the difference on the implementation (the snippet on the documentation 4 gives us a hint on what it does, so we can spot the change).

This might seem as a little issue, but it raised some concerns on the Linux kernel development, at the point that a new function was developed. The strscpy function is being included in the Kernel development for Linux 4.3-rc4 6 because it is a better interface. Some of the problems mentioned in the commit message, that inspired this new version, are the ones described on the previous paragraphs.

This makes me wonder, if this should be the “correct” way for performing this operation “safely” in C. In all cases, the error is the same (not checking the boundaries, and trusting the input), and should be avoided. What I mean by this, is that we cannot simply rely on those functions being secure, the security must be in our code, so the proper way to handle these situations is to code defensively: do not trust user input, always check the boundaries, error codes, memory allocation, status of the pointer (a free for every malloc but not for a NULL pointer, etc.).

1: https://the-flat-trantor-society.blogspot.com.ar/2012/03/no-strncpy-is-not-safer-strcpy.html.
2: https://blog.surgut.co.uk/2015/08/go-enjoy-python3.html.
3: strncpy documentation.
4(1,2): strncat manual page.
5: An array of characters that is not null-terminated, is invalid.
6: https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/?id=30c44659f4a3e7e1f9f47e895591b4b40bf62671.

Setting user permissions in KVM

Mariano Anaya

Sat Jun 20, 2015 - 20:46 -0300

In a previous article I mentioned how to install a library in Fedora, in order to make KVM virtualization easier, managing the NAT network configuration between the guest virtual machine and the host, by means of libvirt.

Besides that, while using KVM locally for development, I use virt-manager, a helpful application that manages the different virtual machines. This application, as well as the rest of the commands that interact with libvirt (virsh for example), require super user privileges, so it will prompt for the sudo password every time.

This can be avoided by including the user into the following groups: kvm, and libvirt.

Therefore, just by running the following command we can skip the password prompt every time.

sudo usermod -a -G kvm,libvirt mariano

This is an option I would use only for local development on my machine. Productive environments must have an strict permissions management.

Notas sobre la ArqConf 2015

Mariano Anaya

Fri May 01, 2015 - 17:45 -0300

Este es mi resumen sobre la ArqConf 2015, la conferencia sobre arquitectura de software que tuvo lugar en la UCA el 30 de Abril de 2015. La idea es sintetizar las principales ideas que me llevé y resaltar lo más importante.

Se presentan a continuación un listado de las ideas principales por charla con un breve listado de lo que más destaco por cada uno. Nótese que la lista no es de ninguna manera exhaustiva, y cada sección es en realidad un breve párrafo ilustrativo a modo de resumen muy a alto nivel.

Cada sección lleva un título alusivo al tema de la presentación, un breve resumen y una lista con los principales puntos que destaco.

Orden en una arquitectura y la agilidad como atributo de calidad

Se basó en la experiencia de un arquitecto liderando un equipo de arquitectura para un desafiante proyecto. El disertante explicó los problemas a resolver, y el marco tecnológico en el que se desarrolló la solución, y cómo con una arquitectura elegante y simple con relativamente pocos componentes se puede llevar a cabo una implementación de gran porte que de soporte a 150.000 transacciones concurrentes.

El factor clave del éxito de una arquitectura es la comunicación.
La agilidad de un equipo como atributo de calidad. Es interesante, porque cuando uno piensa en atributos de calidad se le ocurren cosas como seguridad, mantenibilidad, usabilidad, escalabilidad, etc. pero no el hecho de ser ágil. Sin embargo es en general deseable que el equipo sea ágil y se pueda adaptar fácilmente a los cambios, y en ese caso ¿Por qué no agregarlo como un atributo de calidad?
La flexibilidad del equipo, también como atributo de calidad. Análogo al anterior, pero con un detalle: si es un atributo de calidad, tiene que ser medible. Lo interesante no es sólo la originalidad de este tipo de atributos de calidad “no tradicionales”, sino también en como considerarlos dentro de los escenarios de calidad.
Los requerimientos deben priorizarse en el marco global de la organización.

Arquitectura y Big Data Analytics.

Una presentación excelente, con mucho detalle y riqueza técnica, nombrando tecnologías, metodologías, técnicas, y estilos de arquitectura orientados a Big Data.

Kafka como herramienta para procesamiento de información en colas de mensajes no tradicionales (con estado persistente). Es un ejemplo de una tecnología que tuvo un caso de éxito real en proyectos de Big Data.
Siempre guardar la fuente de datos: llamada “la fuente de la verdad” (the source of truth), es una buena práctica, ya que tiene varias ventajas, como por ejemplo:
- Permite corregir errores en caso de falla (a partir de los datos, se puede volver a procesar y no hay una pérdida irrecuperable de información).
- Preservando los datos originales (raw data), es posible en un futuro elaborar o calcular nuevas métricas si se requieren, cosa que si por el contrario sólo se guardaran los datos procesados, sería imposible.
- El costo extra por el almacenamiento no debería ser un problema, considerando los beneficios.
Procesar la información de forma idempotente: esta quizá sea la idea que mejor refleja una buena práctica general, no solo aplicable a Big Data. En lugar de procesar modificando registros (por ejemplo ejecutando un SQL que sume uno en alguna columna), simplemente se agregue una nueva entrada y luego el resultado se calcule sobre el total. De esta manera no se modifican los datos, y de nuevo, un potencial error es reparable, no hay pérdida irreversible de información, etc. Ésta en realidad es una idea que ya existía en sistemas de BI, pero es interesante notar que registrar los hechos se puede usar para muchos más casos.
Simplificar las variables tecnológicas. En lugar de tener un extenso repertorio tecnológico con muchas tecnologías de propósito específico, es mejor y más fácil de mantener un entorno con menos tecnologías, y, aunque estas no se adapten perfectamente a cada problema en particular, aún así hay que privilegiar el pragmatismo, haciendo los ajustes necesarios.
Tener un esquema de datos (data schema) para poder integrar la información que se procesa desde diferentes fuentes.

Arquitecturas de Micro servicios.

Es muy interesante escuchar sobre los micro servicios, y cómo este tipo de arquitecturas permiten una escalabilidad más flexible.

Las arquitecturas de micro servicios permiten obtener la misma funcionalidad, pero de forma distribuida, en contraposición a lo que sería una arquitectura monolítica.
Esto permite escalar de forma más flexible, por ejemplo se pueden administrar los subsistemas de forma independiente, asignando los recursos o manteniendo más componentes pero más simples.
Esta separación también puede reflejarse en equipos de trabajo, áreas o procesos.

Arquitectura y métodos ágiles

En esta ocasión, se habló de la arquitectura de software desde el punto de vista de las metodologías ágiles y los procesos de desarrollo alineados a los requerimientos funcionales del negocio.

El equipo puede conversar la arquitectura en función de los requerimientos con el PM, sin necesariamente entrar en muchos detalles técnicos, concentrándose en la funcionalidad y comportamiento esperado.
Ésta conversación sobre la arquitectura debe ser constante a lo largo de todo el ciclo de desarrollo.

Arquitectura aplicada la producción

Excelente cierre de la conferencia. Hizo mucho hincapié en cómo se ve a la arquitectura y el rol del arquitecto o el equipo de arquitectura desde el punto de vista del CIO. Ésto dilucidó bastante sobre lo que se espera del equipo de arquitectura para que la organización funcione.

Lo más destacado fue ver qué es lo que se espera y lo que NO se espera del arquitecto, y cómo lo más importante es poder brindar una solución como ingenieros, que responda a las necesidades del negocio. La principal riqueza estuvo en que las ideas fueron ilustradas con experiencias reales en Data Centers reales.

Algo llamativo es que muchas ideas mencionadas son en realidad cuestiones que se asumen en un proyecto de software, pero como sabemos en la práctica no siempre sucede, y esto deriva en malos resultados.

La integridad conceptual es fundamental: Las soluciones deben proporcionarse de forma uniforme, aplicando sendos estilos y tecnologías para los mismos tipos de problemas. Análogamente, si para diferentes proyectos se usan muchas tecnologías diferentes, el resultado es una arquitectura gigante y muy difícil de mantener.
Cada componente técnico interno del equipo de ingeniería no es el principal objetivo de la organización, si no que están para responder a éstos.
Adoptar nuevas tecnologías solo por que presenta algunas ventajas parciales no siempre es una buena idea a largo plazo. Suele suceder que a largo plazo termina teniendo consecuencias perjudiciales para el proyecto.
Los sistemas deben diseñarse y construirse para durar varios años (~10), y esto implica que las tecnologías de construcción tienen que tener varios años de existir, de manera que sea razonable aseverar que seguirán estando disponibles el tiempo que dure el sistema productivo. No sería deseable tener que mantener o hacerse cargo de tecnologías (frameworks, toolkits, etc.) obsoletas.
Criticar las llamadas “buenas prácticas” (o verdades reveladas). Esto significa que cuando algo se denomina como buena práctica hay que plantearse si realmente es así, y aunque lo fuera, si esas ventajas que trae aplican al proyecto en cuestión. Ésta es otra idea más general, se trata en definitiva de tener pensamiento crítico, pero es algo que en muchos casos no sucede, y vemos en general varios proyectos aplicando “patrones de diseño” (o de arquitectura) o “buenas prácticas ágiles”, etc. sin pensar realmente cómo aplican al proyecto (algo puede haber dado resultados excelentes en otro proyecto, en otra empresa, en otro país, pero el arquitecto debe considerar si esas variables realmente coinciden o son relevantes al contexto).

>>> Conclusiones

Considero que la conferencia fue muy buena, teniendo en cuenta la calidad de las presentaciones, la experiencia de los disertantes y que todo estaba alienado conceptualmente, lo cual hizo que la transición entre temas tuviera una continuidad notable.

Es además importante destacar que este tipo de conferencias, además de ser enriquecer la experiencia profesional de todos (disertantes, organizadores y concurrentes), benefician a la comunidad de arquitectos.

Running RabbitMQ server on Docker

Mariano Anaya

Sun Apr 26, 2015 - 13:58 -0300

If you use RabbitMQ for development frequently, sometimes you might have found it uses too much resources (it’s normal while programming to have a lot of queues or tasks being queued and that makes the CPU usage to spike).

Having RabbitMQ installed on the OS seems the perfect approach for production, but on development I’d rather do something different, in order to isolate the process. I know I could bound it (for example order it not to start automatically as a dæmon), by means of systemd but a few weeks ago I decided to try docker and see how it results.

It turned out to be just the tool for the work, and so far with a little simple configuration it can run as expected.

There is already a docker image for RabbitMQ, which can be automatically pulled, and then run, for example:

sudo docker pull rabbitmq
sudo docker run -d -e RABBITMQ_NODENAME=my-rabbit --cpuset="1" --name docker.rabbitmq -p 5672:5672 rabbitmq:3

The -d option indicates the process to start detached, then by passing -e we pass some environment variables (in this case, the RABBITMQ_NODENAME is a particular variable for rabbit indicating how to set the name of the node it is starting). Optionally, we can limit the CPU usage with the --cpuset, as in this case which sets the process to use the second core of the machine (it starts at 0). Then the --name is a name for the docker being created.

The most important part in this case is the port mapping, made by the -p option which in this case maps the port used by RabbitMQ directly (1:1) with the host machine. This makes the docker process to run transparently, as the other applications that try to communicate with a RabbitMQ won’t notice any difference, making it look like is executing an actual RabbitMQ service. Finally there is the name of the docker image to run.

What I usually do is to manage the docker image by its instance_id (a number that is displayed after listing the docker images, by doing sudo docker ps -a). Then we can manage it by sudo docker [start|stop] <instance_id>.

There is another command to see the output being generated by the process which is docker logs rabbitmq.docker. Notice in this case the name designated to the image was used instead of the instance_id. In addition we can see internal data for the process by running the inspect command (again we can use the instance_id or the name we assigned).

docker inspect rabbitmq.docker
sudo docker logs docker.rabbitmq

It’s important to notice that docker is actually not a virtualization platform, but a mechanism that runs processes in containers, meaning that in this case the entire RabbitMQ is running as a single process within a container, with some other limitations and bounds constrained by docker.

I found this approach to be very versatile for a development environment, and with RabbitMQ being the first pilot, I think I can migrate more applications to docker instead of having them installed on the system (as long as possible).

Find Options

Mariano Anaya

Tue Mar 24, 2015 - 11:29 -0300

Among the core-utils, find is one of the most useful commands. Though I use the basic functions most of the time, find has a wide range of parameters, and it comes in handy not only for finding files, but also for operating a bunch of them at once. Here is a very simple example.

Imagine you have to move many files to a directory, but they all call different so a glob is no use, and manually moving all of them is not an option. A possible approach would be to locate the first of the batch (for example by running ls -lrth). Suppose the first one of the batch is called /tmp/checkpoint (for this example let’s assume the files reside at /tmp).

The command would be:

find /tmp -type f -anewer /tmp/checkpoint -exec mv '{}' <target_directory> \;

The -type f part is important in order not to move the entire directory (find only the files). Then we have the -anewer that receives a file as a parameter, and it will filter for those files whose modification date is greater than the file used as an example (hence, this must be the start of the batch), and finally the -exec part is interesting because as mentioned at the beginning, it allows to perform arbitrary operations on the group of files (in this case to move them to another location, but other actions such as modifications, sed, etc. are also possible).

Another trait I like about find is that presents a secure and well-defined interface, meaning that in some cases I can first check the results prior to execute an action. For example, if we would like to check for deleting some unnecessary files:

find . -name "*.pyc"

By issuing this command we list some files to erase. And then we can simply do that by appending -delete to the very same command.

This is just the tip of the iceberg of the things that are possible by means of the find command and its various options.